cd/entity/Rafailov et al.Β· homeβ€Ί entitiesβ€Ί Rafailov et al.
grep -l @rafailov et al. /news/*.json | wc -l β†’ 1

Rafailov et al.

mentions 1 type Person feed RSS

// recent coverage 1 mentions

08:00
2026-07-04
dev.to
large-language-models

DPO vs RLHF: The Alignment Tax You Pay Without Knowing

A developer argues that alignment algorithms like RLHF and DPO impose an 'alignment tax' that degrades model reasoning in favor of sycophantic behavior. The developer claims that both methods optimize…

// co-occurs with top 7 entities